DOCUMENT RESUME 



ED 063 905 



24 



HE 003 224 



AUTHOR 

TITLE 



INSTITUTION 
BUREAU NO 
PUB DATE 
GRANT 
NOTE 



Linsky, Arnold s. ; Straus, Murray A. 

Dimensions of Academic competence: The Relationship 
of Classroom and Research Performance of College 
Faculty. Final Report. 

New Hampshire Univ. , Durham. 

BR-0-A-045 
12 Jun 72 

OEG- 1-70-000045-0014 (509) 

74p. 



EDRS PRICE 
DESCRIPTORS 



MF- $0.65 KC-$3. 29 

♦Class Size; ♦College Faculty; Educational Research; 
♦Higher Education; ♦Professors; Research; ♦Teacher 
Evaluation 



ABSTRACT 

This study determines empirically the relationship 
between college teachers' classroom performance and their involvement 
in research. The study is based on a national sample of approximately 
5,000 faculty members in 16 colleges. Ratings of classroom 
performance were derived from published course critiques and 
scholarship ratings are based on a weighted publication score or 
citation score. Findings show that there is little or no correlation 
between teaching ratings and scholarly activities and, because of 
this, that universities should recruit faculty who are strong in both 
areas. Further investigation found that (1) professors teaching upper 
division and smaller enrollment courses receive better evaluations 
than colleagues in lower division and larger courses; (2) teachers in 
the languages received highest teacher ratings, followed by 
humanities, other social sciences, professional schools, physical and 
biological sciences, sociology, and psychology; and \3) the 
relationship between class enrollment and teacher ratings is 
curvilinear with very small and very large classes receiving highest 
ratings. (Author/HS) 



o 

ERIC 




34 

% 

* 



ir> 



cr 

K\ 

s£) 



UJ 



-Vt 



t- 



Final Report 



Project No. O-A-045 
Grant No. OEG-1-70-000045-0014 (509) 



Arnold S. Linsky and Murray A. Straus 
Department of Sociology and Anthropology 
Social Science Center 
University of New Hampshire 
Durham, N.H, 03824 



DIMENSIONS OF ACADEMIC COMPETENCE: 

The Relationship of Classroom and Research Performance of College Faculty 



June, 1972 




C'i 

pi 




U.S. DEPARTMENT OF HEALTH, EDUCATION, AND WELFARE 

Office of Education 

Regional Research Program 
Boston, Mass. 



?I p **seNT e S °o /vor Wf * 
C4r '0 N P&lfficiAii*** 

° S,T,0 » 0 * l p ° f l >cb 







i 



Final Report 



Project No. 0-A-045 
Grant No. OEG-1-70-000045-0014 (509) 



DIMENSIONS OF ACADEMIC COMPETENCE: 

The Relationship of Classroom and Research Performance of College Faculty 



Arnold S. Linsky and Murray A. Straus 
Department of Sociology and Anthropology 
University of New Hampshire 
Durham, New Hampshire 



June 12, 1972 



The research reported herein was performed pursuant to a grant with 
the Office of Education, U.S. Department of Health, Education and 
Welfare. Contractors undertaking such projects under Government 
sponsorship are encouraged to express freely their professional 
judgment in the conduct of the project. Points of view or 
opinions stated do not, therefore, necessarily represent official 
Office of Education position or policy. 



U.S. DEPARTMENT OF 
HEALTH, EDUCATION, AND WELFARE 

Office of Education 
Regional Research Program 
Boston, Mass. 



o 

CMC 



2 



ABSTRACT 



DIMENSIONS OF ACADEMIC COMPETENCE: 

The Relation of Classroom and Research Performance of College Faculty 

Arnold S. Linsky and Murray A. Straus 
University of New Hampshire 

This study determines empirically the relationship of college 
faculty's classroom performance and their involvement in research. 

The study is based upon a national sample of approximately 5,000 
faculty in 16 colleges. Ratings of classroom performance are derived 
from published "course critiques." Scholarship ratings are based on 
a wieghted "publication score" or "citation score." There appears 
to be little or no correlation between teaching ratings and scholarly 
activities . 

One implication of the absence of relationship between teaching 
and research abilities is that universities that are sufficiently com- 
petitive should recruit faculty who are strong in both teaching and 
research since being good in one role is certainly no bar to being 
good in the other. 

In additional investigations we found 1) professors teaching 
upper division and smaller enrollment courses received more favorable 
evaluations that colleagues in lower division and larger courses. 

2) teachers in the Languages received highest teacher ratings, 
followed by Humanities, "Other Social Sciences," Professional Schools, 
Physical and Biological Sciences, Sociology and Psychology last. 

3) the relationship between class enrollment and teacher ratings 

is curvilinear with very small (under 30) and very large (over 300) 
classes receiving highest ratings. 




3 



TABLE OF CONTENTS 



INTRODUCTION 1 

PROCEDURES 2 

RESULTS 5 

Paper No. 1 "Student Evaluations of Teaching: A 

Comparison of Sociology with Other 
Disciplines" ••••• 6 

Paper No. 2 "Teaching Ratings by Discipline" 8 

Paper No. 3 "Student Evaluations and Research Pro- 

ductivity and Eminence of College Faculty". • 10 

Paper No. 4 "Seniority Versus Merit in the Reward 

System of Academia" 14 

Paper No. 5 "Class Size and Teaching Effectiveness" ... IS 

Paper No. 6 '•Further Considerations in the Relation- 
ship Between Teaching and Research" ..... 19 

CONCLUSIONS 21 

RECOMMENDATIONS 22 

FOOTNOTES 24 

REFERENCES 25 

APPENDICES 26 

APPENDIX A: "STUDENT EVALUATIONS OF TEACHING: A COMPARISON 

OF SOCIOLOGY WITH OTHER DISCIPLINES" 

APPENDIX B: "STUDENT EVALUATIONS AND RESEARCH PRODUCTIVITY 

AND EMINENCE OF COLLEGE FACULTY" 

LIST OF TABLES 

Table 1. Teaching Rating by Discipline 9 

LIST OF FIGURES 

Figure 1. Class Enrollment and Teaching Rating • 18 



INTRODUCTION 



This study analyzes and describes a great deal of data about two major 
roles of college professors: (1) their performance in their classrooms and 

teaching roles, and (2) their performance as researchers and scholars in their 
disciplines. A major emphasis of the research is on the interrelationship 
of these two roles. The study is based on a national sample of approximately 
5,000 college and university teachers located at sixteen institutions of higher 
education. 

The original and major focus of the study was to determine empirically 
the direction and form of the relationship between quality of classroom per- 
formance and involvement in research and, if the data allowed it, to attempt 
to explain the relationship which emerged. Once the basic data set was com- 
piled for the sample, which consists of considerable detailed information 
about the individual faculty teaching performance, quantity and quality of 
their research output, and personal characteristics data, it was possible to 
investigate a series of additional questions of considerable import for admin- 
istration and planning in higher education. These questions include: 

1) Are there systematic differences between the teaching performances 
for the different academic fields and are such differences stable from one 
institution to the next? Are there characteristic differences between groups 
of disciplines (e.g., Social Sciences vs. Humanities) and are there stable 
differences within each of these broad groups (e.g., within Social Sciences, 
are political scientists rated more highly than historians as teachers, etc.?)? 

2) What are the determinants of successful classroom performance (as 
measured through student evaluations of teaching) of college teachers. Is it 




5 



I 



- 2 - 



completely a matter of the personal characteristics, experience and training 
of the individual instructor such as rank, degree, years since completion of 
the Ph.D., and the ranking of the institution from which he received his 
highest degree? Or is the rating of a college professor primarily affected 
by his position within the University structure (his field, level and type 
of course, size of class, etc.) 

3) What is the precise relationship between enrollment, size of class 
and the evaluations of the instructor and the course? Are there critical 
•> points in increasing size of classes at which the quality of the experience 

changes sharply or is it a continuous change with size? Is the relationship 
of size and classroom performance linear or nonlinear? Separate substudies 
that will be reported in the following sections deal with each of these issues. 



PROCEBURES 

Essential data for the study were already available although they had 
been gathered originally for other purposes. Ratings of classroom performance 
of teachers is derived from the series of "course critiques" published by a 
large number of universities and colleges during the last five or six years. 
These course critiques could be criticized on tte grounds that they represent 
only one perspective on classroom performance, i.e., the student’s. However, 
the few empirical studies of student ratings suggest that they have a high 
degree of reliability, are relatively independent of the grades received by 
the student (Voeks, 1962) but are related to the number of students who take 
additional courses in a department, i.e., a larger proportion of students in 




$ 



6 






3- 



sections taught by professors with high ratings took additional courses in the 
field (McKeachie, 1958; McKeachie and Kimble, 1968). One need not be committed 
to the complete validity of the student course evaluation to recognize their 
importance as an index of student reactions to teaching. Without downgrading 
other types of evaluation, compos it judgements of large numbers of students 

must be taken into consideration in evaluating teaching performance. 

1 

The present study is based on data from sixteen schools. Essentially, 
this is an "availability" sample of universities selected on the basis of 
whether a course critique useful for our research was published. (If, for 
example, ratings were not quantified, we were not able to use them.) However, 
the sample includes within it various types of colleges and universities 
including large and small, public and private, prestigious and less distinguished 
institutions. Hence, findings of the study should provide a better basis for 
generalization that any study done at a single school. The sample on the whole 
is over-representative of larger and better known universities. The "bias" 
of the sample may actually aid the validity of the study in this case. If 
teaching and research are uncorrelated for this population of relatively "high 
powered" institutions, then it is very unlikely that they would be correlated 
in otiier institutions in which research is not a major goal. 

The set of Teaching Evaluation questions on which students were asked to 
rate professors and courses varied with each of the schools compared. Usually, 
we had information on approximately ten items on each professor. To get an 
over-all rating of professors* classroom teaching performance, we combined the 
average ratings on several of these items for each school. Although the 

o 

ERIC 



7 



-4- 



\ 



questions were not strictly identical, there was enough overlap in the content 
to allow comparison between schools of an overall teacher's rating. Where 
faculty were listed for more than one course, ratings were averaged. Some of 
the individual items that went into the overall rating . are also treated 
separately in the analysis where possible, since they relate to substantially 
different aspects of teaching performance such as instructors' "personaliza- 
tion" and "course content." 

Since the analysis to be reported involves the ratings of faculty in 16 
different colleges, it was also necessary to convert all teaching scores to a 
common unit of measurement. For this purpose, we carried out a z-score 
standardization by subtracting the raw score for each faculty member in a 
college from the mean score for that college or university and dividing the 
resulting figure by the standard deviation of ratings for the entire college. 
The resulting z-scores express the teaching rating of a faculty in terms of 
the number of standard deviations which he is above or below the mean for all 
faculty in his college. 

Data on the "nonteaching role" of each professor are limited to two 
measures of research and scholarship. 

A Publication Score was computed based on a weighted summary score for 

3 

articles and books written over an approximately 20 year period (Straus 
and Radel, 1969). This score was limited to eight disciplines because of the 
enormous amount of time required to complete the bibliographical search for 
each diccipline: one applied field. Engineering; one physical science. 

Physics; one biological science. Biology; two humanities, English and Phil- 
osophy; and three social sciences. Anthropology, Sociology and Psychology. 



o 

ERIC 



8 



-5- 



A Citation Score was based on the number of times a scholar's work was 
cited by others over a ten year period. It is felt to reflect the scholarly 
impact rather than output, or the amount of influence or salience which a 
scholar's work has. Thus, it is a more qualitative measure of scholarship 
than is the Publication Score. The data were obtained from the Science 
Citation Index . This measure was limited to the fields for which publication 
data were compiled and which were covered by the Index. Hence, English and 
Philosophy are omitted from the Citation Scores. 

A number of other variables included in the study as additional indepen- 
dent variables and as control variables are course level, instructor's rank, 
degree, years since completion of the Ph.D. , course enrollment, course level, 
and prestige ranking of institution from which instructor received his highest 
degree (Cartter Ratings). Data on these variables were obtained either from 
the course critiques themselves or from a catalog of the institution. 

Much of the original research period was consumed with assembling a 
massive amount of research data on approximately 5,000 faculty on 16 campuses, 
tabulating and coding it, and preparing it to be placed on tapes for 
computer analysis. This proved to be a monumental task with over one man-year 
spent on tabulating, coding and transcribing data along. 

RESULTS 



Results of this study are in the process cf being communicated to the 
higher education community in a series of six papers, each covering part of 
the investigation to date. This section divided into subsections. Two 



o 

ERIC 



9 



- 6 - 



papers have already been written from the project. The first paper entitled 
"Student Evaluations of Teaching: A Comparison of Sociology with Other 
Disciplines," is in press (to appear in Teaching Sociology, Vol, 1, No. 1, 

1972). (Originally read at the American Sociological Association Annual 
Meetings, Sept. 1, 1970,) 

A second paper entitled "Student Evaluations and the Research Produc- 
tivity and Eminence of College Faculty" has been completed and is in prepara- 
tion for submission for publication to the Bulletin of the American Association 
of University Professors. A late draft of this paper (which was also read 
before the American Sociological Association Annual Meeting, Denver, August 
29, 1971) is attached as Appendix A-2. 

This section summarizes the findings from those completed studies and 
indicates additional studies that are close to completion. 

Paper # 1 . "Student Evaluations of Teaching: A Comparison of Sociology 

with Other Disciplines," (the abstract follows and the full paper is appended) 
compares the students* evaluation of the classroom performance of faculty in 
different broad fields of study. It describes the rank order of fields 
according to student ratings and attempts to explain the order by consideration 
of additional variables. Within the paper we considered sociology and psych- 
ology separately because of the substantive focus of that particular paper 
while grouping the remaining disciplines into five major categories: languages, 

humanities, other social sciences, professional schools and physical and 
biological sciences. 




10 



To appear in Teaching Sociology 
Vol 1, No. 1, 1972 



AC-1 



STUDENT EVALUATIONS OF TEACHING: 

A COMPARISON OF SOCIOLOGY WITH OTHER DISCIPLINES* 



Arnold S. Linsky and Murray A. Straus 
University of New Hampshire 



Abstract 

This study investigated how sociologists were rated as teachers in compar- 
ison with their colleagues in six other fields. Students’ evaluations of 
professors' classroom performance were obtained for 16 colleges and universities. 
Ratings of classroom performance were derived from published course critiques. 

In a comparison with seven fields, sociology shares the lowest ranking with 
psychology. The rank of the other fields is (from highest to lowest): Languages, 

Humanities, Other Social Sciences, Professional Schools, and Physical and Bio- 
logical Sciences. Several alternative explanations are examined to account for 
this pattern: (1) larger class enrollments in sociology, (2) indefiniteness 

of the content of sociology, (3) unmet student expectations with regard to the 
goals of sociology, (4) lack of career relevance of sociology, and (5) selective 
recruitment of highly alienated students. Some support was found for each 
explanation, but none appears fully adequate based on the limited data available. 
The concluding discussion suggests that teacher ratings based upon course cri- 
tiques confound within a single measure: (1) the students' evaluation of the 

teacher's role performance and (2) the degree of role consensus between faculty 
and students concerning the teacher-student relationship. 



it 



U 



it it 



it 



it 



it 



it it 



it 



it it it it it 



it 



it it it 



it 



it 



it 



it 



it 




11 



- 8 - 



Paper #2 . in preparation, describes the comparison of classroom ratings 
for each separate discipline within the broad categories of the previous paper, 
English and Philosophy, Ar : and Classics are each compared within the broad 
categories of Humanities, Table 2 which follows provides separate teacher 
ratings for thirty academic disciplines. The paper which is planned for pub- 
lication in one of the journals which has a wide cross-disciplinary readership 
also examines some possible explanations for the ordering of fields discovered 
by means of group level correlations. Table 1 which follows contains pre- 
liminary data on evaluation of teachers by field from highest to lowest for 

31 fields. 

TABLE 1 about here 




12 



PRELIMINARY DATA 



TABLE 1. TEACHING RATING - BY DISCIPLINE 
Rank Order (Z Scores) 



Discipline 



Botany 

Other Languages 
Art 

Genetics 

Astronomy 

History 

Classics 

European Languages 

Natural Sciences 

Philosophy 

Physiology 

English 

Zoology 

Anthropology 

Biology 6 Microbiology 

Engineering 

Political Science 

Business 

Education 

Language 6 Linguistics 

Geography 

Chemistry 

Economics 

Sociology 

Mathematics 

Geology 

Physics 

Biochemistry 

Psychology 

Physical Science 

Meteorology 



N 


Teacher 

Rating 

"Z" 


SD 


24 


+5.17 


8.77 


51 


+5.08 


12.34 


150 


+4.65 


9.63 


6 


+4.33 


8.94 


4 


+3.25 


5.85 


369 


+3.04 


8.72 


34 


+3.03 


13.39 


555 


+2.93 


9.20 


7 


+2.43 


11.47 


146 


+1.97 


9.62 


4 


+1.75 


9.29 


604 


+1.68 


8.99 


54 


+1.28 


14.39 


84 


+0.15 


8.93 


67 


-0.21 


8.18 


156 


-0.24 


9.23 


223 


-0.32 


8.93 


174 


-0.33 


9.37 


175 


-0.44 


10.27 


23 


-1.09 


9.73 


82 


-1.52 


7.53 


192 


-1.82 


9.09 


165 


-2.04 


8.88 


183 


-2.63 


9.14 


290 


-2.73 


9.25 


59 


-2.81 


9.54 


170 


-2.81 


9.35 


35 


-2.86 


8.94 


197 


-3.45 


8.86 


24 


-4.13 


7.27 


5 


-6.20 


12.40 



13 



-10 



Paper #3 , "Student Evaluations and Research Productivity and Eminence of 
College Faculty," focuses on the major issues of the relationship of research 
performance to classroom evaluation. An abstract follows and the detailed 
data is contained in the appended paper. 



O 

ERIC 

hfliflaffBEraaaa 



14 



- 11 - 



STUDKNT EVALUATIONS AND RESEARCH PRODUCTIVITY 
AND EMINENCE OF COLLEGE FACUL TY 

Arnold S. Linsky and Murray A. Straus 
University of New Hampshire 

ABSTRACT 

f l i i • 

The relationship of a professor’s classroom performance to his 
involvement in research has been vigorously disputed for some time, 
but discussions of this problem have been largely polemical. This 
study determines empirically the relationship of these two different 
dimensions of academic competence. 

There are at least three basic possibilities regarding the rela- 
tionship of teaching and research abilities: a) Teaching and research 

are positively correlated, i.e. good researchers make good teachers 
while poor researchers make poor teachers; b) Teaching and research 
abilities are oppositely related — good researchers make poor teachers 
and good teachers make poor researchers ; c) Teaching and research 
abilities are independent. There would seem to be an ample rationale 
for both the positive and negative correlations between teaching and 
research . 

Essential data for the study were already available although they 
had been gathered originally for other purposes. Ratings of classroom 
performance of teachers is derived from "course critiques" published 
by a large number of universities and colleges. 

The present study is based on a national sample of sixteen colleges 
and universities selected on the basis of whether a course critique 
useful for our research was published. It includes within it a variety 
of types of colleges and universities including large and small, public 

and private, prestigious and less distinguished institutions. Two 

o 

ERIC 



15 



- 12 - 



measures of the research and scholarship role are used: 

a) A Publication Score was computed based on a weighted summary 
score for articles and books written over an approximately 

* 20 year period . 

b) A Citation Score was based on the number of times a scholar's 
work was cited by others over a ten year period. It is felt 
to reflect the scholarly impact rather than output and is 
based on data obtained from the Science Citation Index . 

Overall Teacher Rating correlates only .04 with Total Publications 
Score indicating an almost total absence of relationship between these 
two major variables. Likewise the overall teaching score fails to show 
any correlation with separate components of total Publication Score such 
as books authored, articles written, etc. Citation score is correlated 
-.05 with overall teacher rating. 

Publication and citation do seem to be correlated with students' 
evaluation of the Instructor's Knowledge (as rated by students). The 
rating for Instructor's Knowledge is correlated .27 with Total Publica- 
tions. Research performance is also consistently correlated with students' 
evaluation of the Course Content, although correlations are small. 

Since research activity does not seem to be closely related to 
classroom performance several other factors which might influence class- 
room performance, as measured through student evaluation were considered 
in the second part of the paper. These are: 

1. Situational factors: There is a very small positive correlation 

(.08, N a 4646) between teaching rating and course level with more ad- 
vanced courses receiving more favorably ratings. Enrollment or class 
size is somewhat negatively correlated with teaching rating. Ratings 
also vary systematically by field. From this data it would appear that 
teacher ratings are only partly due to individual differences in teaching 



16 



-13- 



abilities but also vary with position within the social structure of 
the University, 

2. Personal Characteristics. Academic Rank is uncorrelated with 
overall teaching score (r=.00) N=3530) . However, rank is(a low positive) 
correlated with some of the separate components of the teaching score, 
with students' evaluation of course content (r s ».09, N905) and with in- 
structor's knowledge (^.25, 1^1093). Highest degree held appears to 
be almost unrelated to teaching rating (r"-.06, N»2758). Classroom 
performance also appears to decline over the course of a teacher's 
career as indicated by the correlation between teacher rating and years 
of Ph.D. (r=-.08, N as 1729) , It must be kept in mind that this is not 
longitudinal data, but reflects persons at different stages of their 
career at a point in time. 

This study found teaching, as measured through overall student 
evaluations, essentially uncorrelated with indices of research performance. 
Teaching ratings were found to vary systematically with certain personal 
and situational variables that were subsequently considered. The finding 
of lack of relationship between teaching and research has important im- 
plications for university recruitment and personnel planning. 




17 



-14- 



Pa per #4 , the next substudy, "Seniority Versus Merit in the Reward 
System of Academia", (abstract follows) replicates part of the major 
study just described with data from a completely different population. 
Professor Straus was able to obtain the data from a random sample of 
the faculties of 50 colleges of agriculture. In this study the measure 
of teaching effectiveness is the faculty member's self-perceived teaching 
ability in contrast to the student evaluations used in our own sample. 

The overall findings of only a slight positive correlation between re- 
search and teaching performance is consistent with the findings from our 
sample of 16 colleges and universities which further strengthens the 
conclusion that there is little or no correlation between classroom 
evaluation of teaching and research productivity. This finding appears 
to hold regardless of which measure of teaching effectiveness is 
utilized. 



ERJC 

iifliflaffffifiiaaa 



18 



January 1972 



j> 



Abstract 

SENIORITY VERSUS MERIT IN THE REWARD SYSTEM OF ACADEMIA 

Murray A. Straus 
University of New Hampshire 

Data on the research productivity and self-perceived teaching ability 
of a random sample of faculty in colleges of agriculture (N = 477) show 
little tendency for these two aspects of the academic role to be assoc- 
iated (r = .08). In addition, neither research performance nor teaching 
performance are correlated with rank, though what little correlation there 
is suggests that teaching is more highly rewarded than research (r = .07 
versus -.07). Instead, the largest single correlate of academic rank is 
age (r = .58) and length of service at the college (r ** .54). Neither 
age nor length of service are indicators of high performance as a 
teacher or researcher (the correlation of age with research productivity 
is -.17 and of age with teaching excellence .03; the correlation of length 
of service is .05 with research productivity and -.11 with teaching). 
Similar conclusions are reached if salary is used as the criterion of 
reward in the academic system. However, those high in teaching perfor- 
mance tend to differ from those high in research performance in certain 
ways. Teaching performance is associatedvwithucareer satisfaction but 
research performance is not (r 53 .20 versus -.01). Those high in teaching 
tend to focus their activities strictly within their college whereas those 
high in research tend to belong and attend state and national scientific 
organizations. Finally, although high ability in teaching and research 



3 

ERIC 

hfliflaffBHaaaa 



19 



do not, on the average, get reflected in hi'V-er rank or pay, there is some 
tendency toward rational allocation of roles since those high in teaching 
tend to spend a larger percentage of their time at teaching (r = .20). 

The results of this study cast doubt on the several academic myths, 
including the student myth that teaching and research are inversely related, 
the faculty myth that teaching and research are positively related, the 
publish or perish myth, and the belief that American colleges are run on a 
universalistic-competitive basis with the highest rewards going to those 
who make the greatest contributions to teaching and research. If there is 
any relation between teaching and research performance and reward in the 
academic system, it probably applies only to the relatively few who are 
either so outstandingly bad that they are eliminated from the system, or 
hat tiny minority who achieve national fame. The latter have high 
visibility but are not numerous enough to alter the conclusion that the 
American academic system bases its rewards primarily on seniority and admin- 
istrative contribution rather than achievement in teaching or research. 




20 



16 



Paper #5 « a fifth substudy entitled "Class Size and Teaching 
Effectiveness at 16 Institutions of Higher Education" by Kenneth D. 

Wood undertook a detailed analysis of the relationship between class 
size and student evaluations of teaching performance. 

Figure 1 presents the main data. This analysis is still in 
preparation so that findings reported may be considered only as preliminary. 
The previous research studies of this issue indicated considerable 
inconsistency in their results. Many of the studies were based on com- 
parison of large and small classes at a single institution with some 
arbitrary cutting point for large vs. small classes. Cutting points 
varied between studies. Given the essentially curvilinear relationship 
found by Wood when all class sizes are considered, it is easy to see 
why there would tend to be inconsistency in previous studies which used 
different cutting points or considered only part of the range of class 
sizes. The left hand section of the graph indicates a negative correlation 
between class size and teacher rating. Average teaching evaluation is 
quite high for the very small enrollment courses and falls at a fairly 
constant rate. At class size 40 evaluation falls below average and con- 
tinues downward until 220. The curve then changes direction with classes 
of 300 and over receiving very favorable evaluations. 

One interpretation of these findings is that Professors "gear-up" 
to really large classes by tailoring their teaching methods especially 
for such classroom situations and so perform better than professors teach- 
ing one hundred students who continue to utilize teaching methods that 
are successful only in smaller classes. 



O 

ERIC 

iminaffamiqaa 



21 



-17- 



A second interpretation is that "star" classroom performers are 
especially selected by their departments to fill the large class sit- 
uation and also teachers with outstanding reputations may be the "cause" 
of the large class by drawing enrollments. 

The study has many important implications for university planning 
for maximum effective use of faculty resources. Where enrollment 
responsibilities are high, it would seem to make more sense to run 
several courses in the very large size range and to utilize the faculty 
resources saved by providing classes under thirty as opposed to providing 
many classes at a uniform average size of 100-200. Apparently little 
would be lost in abandoning classes of this size. 



o 

ER i.C 



22 



FIGURE 1. CLASS ENROLLMENT AND TEACHING RATING 









* 

* 



0 % 

£ 



> 



o 

? 



* 



? 



0» 

£ 






5 



a 



8 



st 



ft 



5T 

* 



* 



I 

§ 



Z? 

f** 

o 






i 



i 

i 






j 



» 

t : 



i 



* -i 

} 5 



4 

'£ 

If 

l 



CLASS ENROLLMENT 



-19 



Paper #6 . The analysis for a final paper which is in progress 
elaborates the relationship between different aspects of teaching 
performance and research performance for all schools together. This 
concluding paper will answer the following questions: 

1. Does the relationship between teaching ratings and research 
performance which has so far been calculated only for the entire sample 
vary between (a) different groups of academic disciplines (social 
sciences, humanities, physical sciences, applied sciences, etc.)? 

(b) types of colleges and universities (large-small, high prestige and 
less distinguished universities, schools with large graduate programs 
vs. strictly undergraduate institutions, etc.? 

2. What is the form of some of the relationships that have been 
analyzed so far only at the level of overall correlations? For example, 
an overall inverse correlation between years since Ph.D. and teaching 
ratings has been found. Based, however, on some hand-sorted data for 

a single college, we suspect the relationship is actually curvilinear, 
i.e., that classroom performance increases with the first few years, 
plateaus, and then declines later, (a) Relationship between classroom 
teaching ratings and research performance may also be curvilinear. 

We are testing for this possibility by construction of detailed bi- 
variate tables for teaching and research. 




24 



- 20 - 



Different combinations of the two dimensions of teaching and 
research may be expressed in the following fourfold table: 



Teaching Quality 



High 

Research and 

Scholarship 

Distinction 

Low 



High 


Low 


good teacher 


poor teacher 


good researcher 


good researcher 


a 


b 


good teacher 


poor teacher 


poor researcher 


poor researcher 


c 


d 



Descriptive profiles are being prepared of four academic types 
(a) "the academics'' who are faculty who rank high on both teaching and 
research, (b) the "researchers" who rank high on research but low on 
teaching, (c) the "teachers" who rank high on teaching and low on re- 
search. (d) and finally those who rank low on both teaching and research. 





- 20 - 



Different combinations of the two dimensions of teaching and 
research may be expressed in the following fourfold table: 



Teaching Quality 



Research and 

Scholarship 

Distinction 



High 



Low 



High 


Low 


good teacher 


poor teacher 


good researcher 


good researcher 


a 


b 


good teacher 


poor teacher 


poor researcher 


poor researcher 


c 


d 



Descriptive profiles are being prepared of four academic types 
(a) "the academics" who are faculty who rank high on both teaching and 
research, (b) the "researchers" who rank high on research but low on 
teaching, (c) the "teachers" who rank high on teaching and low on re- 
search. (d) and finally those who rank low on both teaching and research. 



ERIC 



26 



- 21 - 



CONCLJSIONS 



Several conclusions are possible on the basis of the research accom- 
plished during this project. Because of the large number of professor and 
classroom evaluations involved and the large number and variety of types of 
institutions of higher education, these findings are much mere definitive 
than the few previous studies which have examined some of the same areas . 

1) Our findings with regard to the major investigation of the project 
were that there is only a slight or no correlation between teaching effective- 
ness as measured by course critiques and scholarship as measured by publications 
and citations (r = .04, N = 1422). This was true both for our national sample 
which employed student evaluations of teaching and for a separate sample of 
professors in schools of agriculture for which replication was possible 

(r = .08, N = 477). For the latter group self-evaluation of teaching effec- 
tiveness by the professors vas employed, thus strengthening confidence in the 
findings . 

2) Those professors whe teach upper division and smaller enrollment 
courses are more likely to receive favorable evaluations from students than 
their colleagues in lower division and larger enrollment courses. 

3) One of the earliest sub-studies in our investigation compared seven 
disciplines or groups of disciplines on teacher ratings. Teachers in the 
Languages received highest teacher ratings , followed by Humanities , Other 
Social Sciences,” Professional Schools, Physical and Biological Sciences, 
Sociology and Psychology last. This order was substantially stable for the 
16 colleges and universities. From this data it would appear that teacher 
ratings are only partly due to individual differences in teaching ability 



but also are a positional attribute. 

4) Academic rank is uncorrelated with teaching ratings but is; correlated 
with publication scores (r = .39, N = 1065) and citation sco*. a (r = .24, N - 563). 

er|c Z7 






- 22 - 



This would seem on first glance to provide some support for the "publish or 
perish" hypothesis of academic success. However, the strongest correlation 
between rank and any other variable is with the number of Ph.D. years or 
length of service (r = .63, N = 1735). Thus, even the group of highly com- 
petitive universities and colleges represented in the sample primarily reward 
people on the basis of seniority or length of service. A more accurate des- 
cription of the typical academic career than "publish or perish" would be 
"remain and rise." 

5) Classroom performance would appear to decline somewhat ever the course 
of a teacher’s career as indicated by the correlation between teacher rating 
and Ph.D. years of -.08 (N = 1729). It must be kept in mind that this is not 
longitudinal data but reflects persons at different stages of their careers 

at a point in time. 

6) The relationship between class size and teaching ratings appears to 
be curvilinear. Average teaching evaluation is high for very small classes 
but falls continuously until it reaches a low point at a little over 200 
students. The curve then changes direction with classes of 300 and over 
receiving very favorable evaluations . 

RECOMMENDATIONS 

Although the findings are clear, the implications are conflicting given 
the dual commitment of at least most colleges and universities to both 
teaching and research activities . 

One implication is that there would seem to be some value in developing 
more independent reward systems for teaching and for research than currently 
exist, since at best they are only weakly correlated. There is apparently 
no reason why the same goals could not be obtained by allowing a greater 
"division of labor" with faculty specializing in either research or teaching 

o 

ERLC 



28 



-23- 



without career penalties . 

A second and seemingly contradictory implication is that universities 
should hire primarily according to promise as a researcher. Such institutions 
will get a greater overall return for their money since such research produc- 
tive faculty are on the average at least as good teachers as those not par- 
taking in research. 

If the goal of an institution is only teaching, and research is seen as 
either unimportant or as instrumental to producing good teaching, then there 
is little point to take research productivity into account in hiring or pro- 
moting, since these data suggest that research is not necessary for good 
teaching, at least as rated by students. 

A further implication is that universities that are sufficiently compe- 
titive should recruit faculty who are outstanding in both teaching and research 
dimensions of the academic role since being good in one is certainly no bar 
to being good in the other respect. Since this combination of being high in 
both teaching and research is as common as any other combination of these 
traits, it should be well within the limits of possibilities. 

The relationship between class size and evaluation appears to be curvi- 
linear with the very small (under 30) and the very large (over 300) classes 
receiving the most favorable ratings. This suggests important implications 
for university planning maximum effectiveness in use of faculty resources. 

Where enrollment responsibilities are high, it would seem to make more sense 
to run several courses in the very large size range and to utilize the faculty 
resources saved by providing classes under thirty as opposed to providing 
many classes at the uniform average size of 100-200. Apparently little would 
be lost ir abandoning classes of this size. 

Since the results of this research show that good teaching is partly a 
function of structural factors within the University, efforts to improve college 
teaching snould not be limited to improving the skills of individual teachers 
but should also focus on bringing about the conditions under which good 
ERJC teaching is possible, JH) 



-24- 



FOOTNOTES 

1. The schools are Boston College, Columbia College, Barnard, McGill Univer- 
sity, University of Southern California, Yale, University of Minnesota (Duluth), 
SUNY (Buffalo), University of New Mexico, University of California at Los 

;> Angeles, University of Texas (Austin), Ohio State University, University of 

Washington, University of Utah, University of New Hampshire. 

2. A copy of the specific questions used for each school may be obtained 
from the National Auxiliary Publication Service by requesting document number 

and remitting $1 for a microfiche copy or $3 for photocopies to: 
ASIS-National Auxiliary Publications Service, c/o CCM Information Sciences, 

Inc., 22 West 34th Street, New York 10001. 

3. An individual’s publication score was completed as follows: 1 point for 

each article (whether sole or joint author); 2 points for an edited book; 

4 points for a jointly authored book; and 6 points for a solely authored book. 



-25- 



REFERENCES 



McKeachie, Daniel S. 

1958 ’’Students' ratings of instructors: A validity study." Journal of 

Educational Research 51 (January): 79-83. 

McKeachie, Wilbert J. and Gregory Kimble 

1968 Teaching Tips. Ann Arbor, Michigan: George Wahr Publishing Company. 

Straus, Murray A. and David J. Radel 

1969 "Eminence, productivity, and power of sociologists in various regions.' 
American Sociologist 4 (February): 1-4. 




31 



To appear in Teaching Sociology 
Vol 1, No. 1, 1972 



AC-1 



STUDENT EVALUATIONS OF TEACHING: ^ 

A COMPARISON OF SOCIOLOGY WITH OTHER DISCIPLINES* 

Arnold S. Linsky and Murray A. Straus 
University of New Hampshire 

>• . ; 

. '(J 4 :- •' , V ' ■ , “- 

. ,-i ‘•.•vtc-' 

... ' Abstract 

“ m 1 f 

. . * i 

• » i. 

This study investigated how sociologists were rated as teachers in compar- 

*i • r * 

ison with their colleagues in six other fields. Students' evaluations of 
professors' classroom performance were obtained for 16 colleges and universities. 
Ratings of classroom performance were derived from published course critiques. 

In a comparison ; with seven fields, sociology shares the lowest ranking with 
psychology. The rank of the other fields is (from highest to lowest): Languages, 

, i 

Humanities, Other Social Sciences, Professional Schools, and Physical and Bio- 
logical Sciences. Several alternative explanations are examined to account for 
this pattern: (1) larger class enrollments in sociology, (2) . indefiniteness 

of the content of sociology, (3) unmet student expectations with regard to the 
goals of sociology, (4) lack of career relevance of sociology, and (5) selective 
recruitment of highly alienated students. Some support was found for each 
explanation, but none appears fully adequate based on the limited data available. 

The concluding discussion suggests that teacher ratings based upon course cri- 

1 . 1 . 

tiques confound within a single measure: (1) the students' evaluation of the 

teacher's role performance and (2) the degree of role consensus between facu-cy 
and students concerning the teafcher- student relationship. 

A it * it it it it it it it it it it it it it it it it it it it it it it it it 




32 



2 



How are sociologists rated as teachers compared to those in other disci- 
plines? How are their courses rated by their students in comparison with 
courses in art, history, psychology, or chemistry? To what extent are the 
ratings of teachers related to the interest inherent in the subject? These are 
a few of the questions we attempted to answer in this study of student evalua- 
tions of sociology professors and others. 

The issue of the quality of classroom teaching of sociology has become 
increasingly salient in the last few years. How good a job we are doing in 
teaching sociology seems to be beset by paradoxically opposite developments. 

On the one 'hand, we have criticism from many students about the seeming irrele- 
vance of what we are teaching, our preoccupation with theory and methods, and 
the development of the discipline , and our lack of personal and professional 
involvement in processes of social change. Such criticism has not been limited, 
of course, to our students. We have been similarly chastened! by our fellow 
sociologists about the poor and unimaginative quality of teaching of sociology 
and offered suggestions for improving it (e.g. Apostle, 1968 i Friedland, 1969). 

On the other hand , enrollments in undergraduate sociology courses and the 
number of majors have burgeoned in the last few years. This would seem tc indi- 
cate that students find the field of sociology attractive, if not its style of 
promulgation in the classroom. 

’ • * r ■ , 

SOURCE OF DATA 

. * ;Tf 

vr 

Data for this report is taken from a larger study of academic competence, 
investigating the relationship of a professor 1 s classroom performance to his 
involvement in research. The larger study involves a national sample comprised 
of the faculties of sixteen universities covering almost all disciplines. This 
is a convenience sample, selected on the basis of whether a course critique 




ERIC 



! . 
| 



i 

) 

'i 

1 

H 






3 



useful for our research was published, i.e., critiques which included systematic, 
quantitative ratings on professors and courses. The sample includes a wide 
range of institutions, including large and small, public and private, presti- 
gious and undistinguished institutions, but is overrepresentative of larger and 
better known universities. 

From each of these course critique books , we extracted student evaluations 
of professor's classroom performance individually and by discipline. Later, we 
expect to correlate teaching performance ratings with other measures of academic 
competence, such as research productivity and professional eminence. The present 
study is limited to comparisons by discipline based on the teaching evaluations. 

Course critiques or course evaluations have been published by a large 
number of universities and colleges during the last five to six years. Most 
state as their goals, providing objective information to aid students in the 
selection of courses and improvement of teaching through drawing attention to 
student views. Whether or not they have met these goals is unclear, but they 
have, at the minimum, provided interesting reading in many college communities. 

"Course critiques" have been criticized on the grounds that they represent 
only one perspective on classroom performance, i.e., the student’s. However, 
the few empirical studies of student ratings suggest that they have a high 
degree of reliability, are relatively independent of the grade received by the 
student (Voeks, 1962) but are related to the number of students who take 
additional courses in a department, i.e., a larger proportion of students in 
sections taught by professors with high ratings took additional courses in the 
field (McKeachie, 1958; McKeachie ar.d Kimble, 1968). One need not be committed 
to the complete validity of the student course evaluation to recognize their 
importance as an index of student reactions to teaching. Without downgrading 



ERiC 



34 



r-4 O 

c 

■ -f? 

u 

Vj 



u 



o 

o 

JC 

o 

w 



VO 

CM 

• 

+ 



%o c* 

o o 

• • 

+ + 



w) 

o 

* 

4 



vs 

cm 

• 

+ 



*0 

r4 

4 



o 

4 



s 



os 

f-l 

t 



vO H 

r> n 

• ■ 
• • 



<n 

i 



* 



04 



o 




s 

I 

CO 

l-l 

4 

I 




2: 



vo 

n 

4 

+ 



< 7 * 

C 

i 



© 

<?* 

1 

4 



CM 

r-l 

4 



H 

4 



kO 

n r^ 

• r-4 

t • 



o 

ERIC 













cs 


VS 


Os 


O 


1^4 


O 


r^. 


















*■« 


CM 


O. 


i4 


CM 


n 












x 




• 


• 


4 


• 


• 


♦ 


4 
















4 


1 


+ 


+ 


1 


+ 


1 


























‘w' 








w 




























t 0 
















































/*V 








s 










CO 


so 


3 


O 


VS 


sO 


CM 






4 U 










0 


© 


H 


© 


rs 


H 












4 




4 


4 


• 


4 


• 


4 


4 






* c 










4 


1 


« 


-f 


t 


• 


1 






* fa 




















• 






« 




























ra 




























r-i 






































OS 




OO 


t 


PNi 




VS 






§ 






5*s 




N 

4 


«*1 

4 


• 


1 

1 


m 
• , 


Cl 


CM 

• 






§ 

to 


% 








4 

w 


« 


4 


1 

1 


• 

S-/ 


»* 


1 

w 






V) 


p 




























0 










VS 


o\ 


1 




n 


H 






V- 


as 








<n 


•H 


CM 


1 


CM 


Cl 






O 


0 


X"* 


>*■) 




■ 


• 


4 


1 


• 


4 


4 








to 


CO 






4 


4 


f 


1 


• 


• 


« 






V) 




to 
























• 


?s 


g 
























ss 


pi 


0 
















A 


/-N 






5 


w 


0 






CO 


sQ 


O 


Os 


rs 


CM 


O 






P 


fc 


CO 






c? 


«ri 


CM 


Cl 


© 


as 


Os 






c 


>< 




M 




4 


■ 


4 


• 


• 


9 


• 








H 


cm 






+ 


4 


• 




1 


1 


1 






etf 


U5 


















W' 


%-* 






H 




w 
























V5 


g 


0 
























SC 


O 


A 


1 




/s 










A 








M 


lu 


(E 






O 


sJO 


w* 


00 


© 


3 


rv 






U* 




fca 






r* 


CM 


O 


0 


© 


© 






q 


4**‘ 


SJ 




• 


4 


• 


4 • 


4 


• 


* 






O 


H 


< 






4 


4 


1 


4 


•4 


• 


l 






?5 


S 








w 






V 












O 


1*4 


























M 




























H 


>- 








• 4$ 


M> 


CM 


00 


Os 


r-C 


o> 




m 




« 








VI 


CM 


© 


0 


O 


r^. 


r^ 




O 


t3 






© 




• 


4 


• 


4 


• 


4 


« 




J* 


n 4 








+ 


4 


I 


+ 


• 


1 


• 




M 


•1 
















V 










9 


> 


























O 


IJ 
























W 


U 






















A 








to 










CO 


H 


O0 


*4 


CM 


CO 


H 


O 


O 


• 










<*) 


© 


•-4 




© 


Cl 


CM 


0 


H 


H 






u< 




4 


• 


• 


• 


• 


« 


« 




rs 












V 


4 


4- 


I 


1 


1 


1 


0 


w 










V 








w 


f 


M 




Q 


























•rt 


70 
























« 


u 


£l 










cs 

04 




3 


CM 

O 


12 


OS 

CM 


12 


4-» 


M 








C* 




4 


4 


4 


4 


* 


4 


• 




VI 








% 


4 


4 


f 


4 


1 


1 


• 


U-i 

O 


01 

r-4 

« 


$ 










os 


00 


VS 


CO 


VS 


VS 


r*- 


c 


g 










» 


N 

1 


CM 


© 


CO 


4 -k 




<T 


0 


O 


U1 






0 




4 


4 


4 


m 


• 


• 


•H 




kJ 

M) 










+ 


4 


4 


1 


1 


• 


1 


a>. 


"O 

0> 


9 
























'-4 


(A 






















cs 


/•s 


^ 0 


C3 












cO 


l/S 




1 


sf 


r-> 


VI , 


, OH 


43 












*S 


VS 


0 


1 


r-t 


CM 


-7 1 


r, D 










0 




4 


4 


0 


1 


4 


4 


4 


C *H 
nj 


V> 












+ 


1 


♦ 


1 - 


4 


1 


4 


H 


















1 




“Vw* 


Sm* 


O 

Ci i> w 


Cl 
























t f 


’ O Cl 


M 






















CN 


A 


14- x> w y* 


O 












vs 


00 


sQ 


N 


© 


vs 


vs 


a v 0 


O 












cm 


#H 


•H 


CO 


cs 


C| 


VO 


iH flS *4 0 


(A 








CO 




• 


4 


9 


4 


« 




4 


■ 0 CO 


O 










JL 


4 


t 


4 


t 


4 


1 


*s 0 












w 






v 






• S-/ 


O W M 


4T 


























C O 


*J 












s 


. 








^4 




r-l N no 
























CO 




vl} O O 














00 


vO 


CO 


1 


»A 


sO 


'«<f ■ 


,1’i no iJ 


CJ 












0 


CM 


CM 


1 


CO 


4 


*•■4 


O 0 C 


x: 












4 


4 


1 


• 


1 


« 


r: u c^? 


AJ 










4- 


4 


> 


1 


1 


s/ 


4 


4-/ t*' «Ct 














v 






9 








c n *v 


W 


























c *h ;■. 


01 * 


























w* j ti 


4J 




* 














M 








ic ;■ 


OJ 


















<9 


S3 






i-* ,r. 


O 














w 




r 


fU r-4 




>4 


X 4*1 vl) CJ 


•rl 












Vi 






0 




>s 


CO 


C *>' -V 














fj 




(0 


■H 


H J 


i.s 


O 


4^ I-* « O 


C- 












X >1 


4J 


O 


<0 t/i 


n -4 <y 


0 




* * Vj S 4 


t—l 












\'i 




4.1 t) 


W *~4 


C 0 


fr* 


O 


d' c 0 0 










13 




J 


r- 

y; 

r; 


a c 


W O 


* * 0 r; 


0 


/, * 


0 xs > > 












‘0 


•H Ci 


't O 


'^5 4-C ^ 




t> 


t ) 0 -c 


/“S 








to 






:•? ' 


’) *+ 


O .?"! 


0 


u 


7 n 


rj .0 C X3 










HH 






3 


0 u 


W O 


•-* 0 


V 


1 *5 












r/4 




J 


;i: 


t/i V5 


a. VI 


rv. rO to 


cs 


Oh 







35 



4 



other typos of evaluation, the composite judgments of large numbers of students 
must be taken into consideration in evaluating teaching performance. 

While the critiques were originally intended to provide information on 
individual professors and courses, they also provide information on the compar- 
ative standing with students of the teaching within the different academic 
disciplines. 

The set of questions on which students were asked to rate professors and 
courses varied with each of the schools compared. Usually we had information 
on approximately ten items on each professor. To get an overall rating of 
professor's classroom teaching performance, we combined the average ratings in 
several of these items for each school. Although the questions were not 
strictly identical, there was enough overlap in the content to consider their. 

"conceptually 5 equivalent" (Straus, 1969). However, to allow comparison be- 

; > ) 

tween schools of an overall teacher's rating, it was also necessary to convert 
all scores to a common unit of measurement. We carried out a z-score stand- 
ardization for this purpose by subtracting the mean for all courses in the 
college from the mean for a specific department and dividing the resulting 
figure by the standard deviation of ratings for the entire college. The 
resulting z-scores express the average teaching rating of a department in terms 
of the number of standard deviations which the department average is above or 
below the mean for all departments in the college. 

FINDINGS 

Table 1 indicates the relative ratings of several separate disciplines 

Table 1 about here 

and groups of disciplines for the, sixteen universities and colleges. 

In answer to the first question that we posed, sociologists are rated 

ERJC 



36 



below average as teachers by the students of fourteen of the sixteen colleges 
and universities studied as indicated by the negative z-scores. With an average 
weighted z-score of -.33, sociology is among the two fields receiving lowest 
scores on teaching, a nebulous distinction shared with psychology. On the other 
end of the continuum, teachers of ;the languages received consistently favorable 
ratings by students. They scored above average in all of the sixteen schools. 
They are followed in rank order by the humanities, other social sciences, pro- 
fessional schools, physical and biological sciences, sociology and psychology. 

There appears to be some consistency between schools at the extremes of 
the continuum. It is unclear, however 1 , if the variability in rank order for 
the intermediate disciplines represents real differences between schools or is 
the product of small sample variability. 

DISCUSSION 

Several alternative explanations for the pattern of findings are examined 
in this section although there is insufficient information to evaluate them 
fully. 

1- Enrollment and Ratings . One striking thing about the findings is that 
the order of student ranking of teaching is either unrelated or inversely re- 
lated to enrollment trends in different disciplines. But student ratings may be 
associated with differences in the average size of classes in different disci- 
plines. We suspected that students tend to regard instructors of small classes 
more highly than they do instructors of large lecture classes to whom they may 
never have spoken directly. We found an individual Pearsonian correlation of 
-.10 between instructor rating and course enrollment (N * 4257, all disciplines 
combined). Moreover, for various reasons., the average class size in sociology 
and other social sciences tends to be large. 3 Thus, the negative correlation 
of class size with student ratings could be a factor underlying the low ratings 
of instructors in sociology. 



